N86- 30361 

DEASEL : An Expert System for Software Engineering 


by Jon D. Valett and Andrew Raskin 


ABSTRACT 

For the past ten years* the Software Engineering Laboratory Cl] 
(SEL) has been collecting data on software projects carried out 
in the Systems Development Branch of the Flight Dynamics Division 
at NASA’s Goddard Space Flight Center. Through a series of 
studies using this data* much knowledge has been gained on how 
software is developed within this environment. Two years ago 
work began on a software tool which would make this knowledge 
readily available to software managers. Ideally* the Dynamic 
Management Information Tool (DynaMITe) will aid managers in 
comparison across projects* prediction of a project’s future* and 
assessment of a project’s current state. This paper describes an 
effort to create the assessment portion of DynaMITe. 


1.0 Backround 

Assessing the state of a software project during development 
is a difficult problem* but its solution contributes to the 
success of the project. By determining a project’s weaknesses 
early in its life cycle, problems can be dealt with quickly and 
effectively. For the software manager to perform this assessment 
he needs easy access to detailed* accurate information 
(knowledge) regarding past projects within the development 
environment. He then incorporates this Information with his own 
knowledge of software engineering to make some assessment of a 
project’s strengthes and weaknesses. The DynaMITe Expert Advisor 
for the SEL (DEASEL) Is the first version of an expert system 
that attempts to simulate this process. 

2.0 Developing and Using Rules 

Basically* DEASEL assesses an ongoing project by attempting 
to answer a simple question such as "How is my project doing?!' 

To answer this question DEASEL utilizes a knowledge base of rules 
for evaluating software projects. This knowledge base consists 
of rules derived from two sources: the SEL database and 

experienced software managers. DEASEL uses these rules along 
with data on the project of interest, to give the manager a 
relative rating of the quality of that project. 


J. Valett 
NASA/GSFC 
1 of 21 



2.1 Corporate Memory 

Of course; a major effort in the development of the DEASEL 
system was the actual collection of knowledge. To derive rules 
from the corporate memory* former studies [2,3,4,5,6,7,81 
performed by the SEL were reviewed to find relationships that 
affect the quality of a software project. That is, many studies 
of data concerning the SEL environment have been done within the 
last ten years. These studies give some idea of the cause and 
effect of technologies and methodologies on a software project. 
Thus, relationships like ’'increasing tool use will Increase 
productivity" are found. Because of the interdependencies amoung 
the items the strength of each relationship is then determined. 
For example, many different factors may influence productivity, 
therefore the determination of which of these have the most and 
which the least influence must be made. This has been a long and 
difficult process because of the amount of data and the problems 
with determining what data is relevant to the assessment process. 

2.2 Knowledge from Software Managers 

The other source of knowledge is the experienced software 
managers, who have certain "rules of thumb" they use to evaluate 
a software project. They are questioned to obtain this 
subjective information which is then used along with the more 
objectl ve -material to produce the knowledge base. Again the 
determination of the st rengthes of the relationships must be 
performed. The entire process of collecting knowledge is long 
and difficult and has only just begun for the DEASEL project. 


2.3 Representing the Rules 

After collecting a preliminary set of knowledge, thought 
began on how to actually represent this knowledge. The initial 
work on knowledge representation for DEASEL was directed at using 
standard expert system techniques. Including if-then production 
rules. But soon the discovery was made that knowledge regarding 
the assessment of a software project's development is more 
naturally represented in a different manner. In fact, the 
overall conclusion drawn from an assessment is quite different 
from that drawn by a traditional expert system. The difference 
lies in the type of question answered by DEASEL. The traditional 
medical expert system, such as the often cited MYCIN [9], 
answers a question like "What disease does patient X have?" 

Then, given some data on the patient the system determines the 
disease. DEASEL, on the other hand, must answer the question 
"How is project X doing?" Thus, It must give a rating to the 
system based on the facts given to it. The analagous question in 
the medical domain would be "How Is patient X's health?" 

In order for DEASEL to answer the question "How is project X 
doing?", it needs two different types of knowledge. The first 
type of knowledge is the assertions which relate to the specific 


J. Valett 
NASA/GSFC 
2 of 21 



project in question. This includes the facts known about the 
project as it currently stands. The second type of knowledge is 
the detailed representation of how different facts affect the 
overall development process of a project. These are the more 
general "rules” on what affects the quality of a software 
project. These rules are set up based on the knowledge described 
earl ier from the data base and the software manager. They are 
used to describe all of the factors which affect a software 
project's quality and all the sub-factors that affect those 
factors* etc. For this reason this system of knowledge 
representation, which is unique to DEASEL, is called factor- 
based. Each rule in the factor-based representation scheme 
specifies a system and its factors (sub-systems) and the weight 
(strength of the relationship) each factor has on the system. 
Thus, between the specific assertions about the project and the 
general rules concerning software development within the SEL 
environment DEASEL can rate a project. 


2.4 An Exampl e Rul e 

To explain how this rating process works, here is an example 
rule from DEASEL's knowledge base: 

The factors that affect Computer_En v i ronment_Stab i 1 ity are 


1) Operating__System_Stabil ity .3 

2) Software_Tool_Stabiltiy .2 

3) Hardware_Stab i 1 i ty .4 

4) Computer_Env_Proc_Stab i 1 i ty .1 


The number associated with each factor is a weight, and the sum 
of the weights must always total one. This rule states that the 
four listed factors have an affect on the quality of the 
Computer_En v i ronment_Stab i 1 i ty. The rule's weights indicate that 
Hardware_Stab i 1 1 ty is the most Important factor in the assessment 
of Computer_Env i ronment_Stab i 1 ity, while 

Computer_En v_Proc_Stab 11 ity is the least important factor. 

DEASEL uses the ratings of all four factors to determine a rating 
for Computer_Env i ronment_St ab i 1 ity. 


2.5 Deriving Conclusions 

DEASEL's overall assessment process consists of trying to 
assign a rating to each of the quality indicators specified via 
the knowledge base. Obviously just answering the question "How 
is project X doing?" will not give the manager specific enough 
information about his project. Therefore, the knowledge base 
specifies the top level factors DEASEL should rate. Currently, 
the knowledge base has four such quality indicators: 
reliability, predictability, stability, and controlled 
development. Thus DEASEL actually gives information (a rating) 
on each of these four Indicators which gives the manager an 
assessment of how his project is doing in these areas. In order 


J. Valett 
NASA/GSFC 
3 of 21 



to rate these four factors DEASEL must find the rules which 
relate to these factors and assign a rating to these rules. That 
Is, DEASEL reaches a conclusion on what It believes Is the rating 
of these Indicators. For DEASEL to do this It must first reach 
the conclusions on the factors which affect these indicators. Of 
course, these factors may have rules which specify their 
assessment, so this process continues until all of the necessary 
conclusions are reached. 

DEASEL reaches conclusions In one of three ways; 

1) The conclusion can be an assertion from the knowledge 
base. 

2) DEASEL can infer the conclusion based on other 
conclusions and Its rule base. 

3) If both 1) and 2) fall, it can ask the user to supply 
the conclusion. 

The three types of conclusions combine to allow DEASEL to make 
Its assessment of the supplied quality Indicators. The basic 
process is to first find a ru le for one of the quality Indicators 
then to resolve all of the conclusions necessary to reach a 
conclusion for that Indicator. This process continues by 
reaching conclusions in each of the three ways, until all the 
conclusions are resolved. 

To fully understand the rating process one must also 
understand how these conclusions are reached. A conclusion Is 
reached when a rating has been assigned to a factor In the 
knowledge base. A rating Is defined as a number between zero and 
one, the higher the rating the better the factor’s qual Ity. A 
rating of .5 would be average or normal. Note that the ratings 
always Indicate quality, for example a rating of .7 for error 
rate as a factor would Indicate a lower than normal error rate. 

In addition, every conclusion has an associated certainty. A 
certainty is the probability that the conclusion's rating is 
correct within some fixed delta. Currently, DEASEL sets delta at 
0 . 1 . 

All three types of conclusions have both a rating and a 
certainty. Type 1 conclusions are really the assertions 
described earlier. Currently, the asssert i ons are entered by 
hand into the knowledge base. In the future this process w 1 1 1 be 
automated and wll 1 be done by the DynaMITe tool, via the SEL data 
base. The certainties for these conclusions are generally very 
high (around .9) because the ratings are basically comparisons 
between real data and average or normal numbers. Conclusions of 
type 2 are computed using the following formulae; 

Rating = (Rating of f actor ( 1 ) x Weight of factor(O) 

* 

Certainty = V (Certainty f actor ( 1 ) x Weight of factor(l)) 

i-i 

where n is the number of factors In the rule 

Thus, a rule for a certain factor is given a conclusion by using 
these formulae to calculate its rating and certainty. The schema 
used here should look familiar to anyone with knowledge of 


J. Valett 
NASA/GSFC 
4 of 21 



probability. In Its typical application, however, each of the 
factors In the system being rated must be Independent* In the 
complex and unfamiliar domain of software engineering, such an 
assumption may be Incorrect. Our computations could therefore be 
slightly or grossly In error depending on how much the knowledge 
base violates this constraint. Future DEASEL knowledge engineers 
must keep this In mind when creating and modifying the rule base. 
Type 3 conclusions are necesssary when the system cannot use type 
1 or type 2 conclusions. In order for the system to complete an 
assessment It must have conclusions for all the factors In the 
knowledge base. Since expert systems must deal with Incomplete 
knowledge, whenever DEASEL cannot reach a conclusion for a factor 
It assumes a normal rating (.5) with a certainty of .2. Note 
that the .2 is the probabll Ity that the rati ng will be correct 
within + or - delta* which in effect makes for a meaningless 
conclusion. Whenever DEASEL Is forced to do this. It makes a 
note to ask the user If the conclusion can be provided. Thus, 
the user can later provide the answers to questions about the 
Incomplete knowledge. Once these questions are answered, DEASEL 
gives the rating supplied by the user a certainty of 1.0. 


2.6 Current DEASEL Capabilities 


The capabilities of the current DEASEL system Include 
allowing the user to obtain an assessment of his project, if some 
assertions exist for that project. After the Initial assessment 
is given the user has three options 1) asking for an 
exp 1 anant Ion, 2) answering questions about his project, and 3) 
playing what-if games. For any conclusion, the user can ask for 
an expalnantion of how the conclusion was reached. The 
explanation consists of the conclusions DEASEL reached about the 
factors of the original conclusion. That Is, the user Is able to 
ask DEASEL what caused It to reach any specific rating for any 
factor. This process can continue as the user asks for 
explanations of the factors previously reported on, and so on. 
Earl ler we mentioned that DEASEL makes a note of type 3 
conclusions. The user may opt to answer these questions as he 
wishes. He may also respond to the questions by indicating he 
does not know the answer. In this case, DEASEL maintains the 
meaningless conclusion reached earlier. Answering questions is 
encouraged because it leads to more certain conclusions. What-if 
games aid the manager In evaluating the effects of changes he may 
wish to make In his project. This process allows the user to 
enter controls Into the system, by actually changing conclusions. 
That Is, the user can see what will happen if he changes certain 
conclusions In the knowledge base. After changing one or more 
conclusions he can then reassess the project, to determine the 
affects of these changes. This is an Important feature of the 
DEASEL system, because it allows the manager to determine how he 
might be able to Improve his software project. 


J. Valett 
NASA/GSFC 
5 of 21 



3.0 Summary 

Although the current version of DEASEL does begin to attack 
the problem of project assessment# much more work is needed to 
make the system a useful tool. Three potential directions exist 
for future work: adding to and verifying the rule base# 

verifying the accuracy of the assessment process# and automating 
the creation of the assertion portion of the rule base. A1 1 of 
these areas will require time and effort to complete# but are 
necessary for successfully determining the validity of this 
project. Obviously# DEASEL is but an initial attempt at solving 
the problem of automating the process of assessing the state of 
an ongoing software project. DEASEL has# however# given some 
insight into the problem and ways to solve it. Hopefully this 
initial work will lead to techniques for solving the problem more 
complete! y . 


J. Valett 
NASA/GSFC 
6 of 21 



REFERENCES 


1 , 
2 , 
3 , 


SEL-81-104, The 
F.E. McGarry* G. Page* et al . » February 1982 


:» D.N. Card* 


SEL-83-002 * Mgagltrgg ini Mgtxigg IQS. 

D.N. Card* F.E. McGarry* G. Page* et al . * March 1984 

Equations . K . Freuberger an^V . R. Bas i 1 i » May 1979 

McGarry, F.E., Valett, J., and Hall, D*. MgaagrlJig ±Jl£ Iffi£a£± 
of Computer Resource Qua ! it y on t he So ftware fig^glgpinglll 
Process and Product . Proceedings of the Hawaiian International 
Conference on Systems Sciences* January 1985 

D. Card, R. Selby* F.E. McGarry* et al . » April 1985 


6. SEL-82-004 , Collgglgd. £gf twang 
July 1982 


7. SEL-83-003* 
November 1983 

8. SEL-85-003, 
November 1985 


iLfiliwang 


Eapg.ngn Istl I* 
la 1 II* 
lal III* 


9 


Short! iffe* E.H., Compute r-fiased. Mg.dlg.al £gn sa 1 1 at 1 o nax M y g i n* 
Elsevier* North Holland, New York, 1986 


J. Valett 
NASA/GSFC 
7 of 21 



THE VIEWGRAPH MATERIALS 
for the 

J. VALETT PRESENTATION FOLLOW 


J. Valett 
NASA/GSFC 
8 of 21 



DEASEL : An Expert System 



-+o 

-ho 

ih o 

S 

o 


e 


s 

* c° 

Co 

3 

CD 

5- 

s 



J. Valett 
NASA/GSFC 
9 of 21 



DEASEL 



J. Valett 
NASA/GSFC 
10 of 21 









KEY ISSUES 



23 

O 




£ 




CO 


CO 


<=D 


CO 


CO 

Cw 

cs 

^3 


CD 



o 

o 


<3 


5- 




£ 

CO 

23 

P 

05 

CO 

£ 

■<-> 


■+-> 
h H 


CD 


mO 

23 co 

Si 



CD 

£ 

CD CD 

CO 

CD 

CD 




-KO 

O 

Ps 

3 

£ 

•2 o 

CO 

p 

S- 

HO 5^ 


o 

3 ^ 

' — :> 
•<s> 

CO 

£ 

F a> 
h 5- 

13 

CO 

'c=> 


CO 

^ s 

CO 

•<** 

£ -2 

-s3 


-4-s 

*«* ^ 

e ° 
S~- co 

3 

<0 3 

£ 

3 

si 

a 

o 

o 

• 

csi 

oi 


CW 

O 

o 

- 4-5 


CO 

p 

3 






' 4 — » 


o 

CO 


S3 


CO 


CO 

S3 


J. Valett 
NASA/GSFC 
11 of 21 



COLLECTING KNOWLEDGE 

From Corvorate Memory I From Software Manager 



<D 

O 

C 

o 


CO 

O- 

1 c 

CO 

c 

JO 

o ' 

CD 


CO 


z ill 

-£ -I E .§ 

o c — 

<!> 

lT 

O *0 m <35 
^ a> > cn 
o> cr 
o c o 
a> o ^ -c 

£ «•= 

o- * S g, 

O o •— 
o a>-»- co 
av> a) 
H- <0 t3 

o o> 

M- C H- 

ocr: o 


o> 

c 

CD 

tn 


^ 6-2 o 
o c _ 


cn 

a> 


co I 


<o 

I 


d 

I 


CO 

Q- 

co 

cr 

o 

CD 


o 


T 


<D 

<o 


> 

~o 

*o 

o 

u 

CL 


"O 

o a> 

o to 
o 

<D 

“O i- 

a> O 
V) c 
O 

a> 

L. 

o 


a> 

CD 


CO 

CX- 

<o 

c: 

o 

o ' 

<D 


CO 


CD 

CO 

~o 

CD 

o 

c 


o ^ 


o> 

c= 

CD 

tn 


o 


J. Valett 
NASA/GSFC 
12 of 21 



J. Valett 
NASA/GSFC 
13 of 21 




TO ANSWER THE QUESTION . 


o 

E p 

o £ 
<= o 


o — 
"8 


CD -CZ 

“O -O 

o 

o c 

CD 

O <^> 

"O 

CD 


CD _q 


s>j 

§ E 


f n cr> 

C /?* 


C S= -ii' 
•== o 00 

O ( — <15 

Eo £=3 


J. Valett 
NASA/GSFC 
14 of 21 


• Controlled Development 



THE KNOWLEDGE BASE 


GO 

'M 


0 

a> 
c: 
a> , 
on 

1 

co 

CD 

QC 


*2* 

£ 

o 

^'o-i 

l»w> '-J 1 

o Ci 
o 


CD 
i— 

a 

g** 

la 

=2 a> , 

fi'gA 

O 

o> 

c: 

* — 

o 

c 

CD 


r; o 


CD 


_Q 

a 

"to 


if) 
u 
O O 


c 

c?.5> 


00 
CD 

O O "O 

o I I 


V) 

h- 

o 

Cj 

*•*% 



J. Valett 
NASA/GSFC 
15 of 21 



THE RATING PROCESS 

A Simple Example 




r-- 

» 


CD 


-o 

rr> 

O 

o 

• 

'o 

* MM 

a> 

s 

"o 

a 

cm 

CO 

CD 

CD 

£= 

CD 

O 

*CD 

_cr 

a? 

o 

Q 


J. Valett 
NASA/GSFC 
16 of 21 



THE RATING PROCESS 


cry 


04 


CD 

T3 

O 

C-> 


cn 
2: 
cn O 
z *z 

Q ^ 
00 ^ =5 
**>. UJ LU O 

ST 3 (/) 1 

1 . ^ ;5 00 *< 

< o 


o 

a> 

o 

Cd 

CD 

CD 

C 

o 

_c= 

0 


NK 

NO 

CO 


<D 

o 


CO 


CD 

O 


_Q 

0 

O 

* 

CD 

CD 



O 

ct: 

M — 

O 

CD 

CD 

cn 

c*- 

u. 

0 

0 

0 

CJ) 

0 

Li_ 


<D 

“O _ 
o NO 
CJ> * 


_Q 

a 

cn 

c_ 

cd 

‘to 

CD 


_o 

a 

cn 


c 

CD 

‘CO 


O 

00 

k_ 

o 

CJ 

o 


00 

00 • 

£= 

O 

-C- c 

CJ) CD 


c- ^ 

£— Q3 

js .2><b 

*— 1 00 

CD >-*— 
Q O 


° ■*= 

. a 

o 3 

Z Or 


J. Valett 
NASA/GSFC 
17 of 21 



THE RATING PROCESS 

A Simple Exomple 




J. Valett 
NASA/GSFC 
18 of 21 




J. Valett 
NASA/GSFC 
19 of 21 


PLANS 


CO 

o 

CO 

a 


Pm 

co 


00 


m 


b- 

o 


a 

£ 

o 

■<s> 

-*K& 

p 

Co 

<4 - fc » 

•«*» 

s* 

CO 

o 



^-O 

O 


a 


ns 

ns 

•c* 

ns 

a 



• 

• 


CO 


CO 

O 

Co 

/—2> 

o 

P 

O' 

o 

P 

*c-* 

P, 

**K5 

p 


o 


CO 

ss 

co 

o 

a 

CO 


Co 

o 



Co 

£ 

Co 

o 

a 

*c^ 


*+*> 


a 

s 

p 

o 

o 

p 

e 

p 

o 

a 

o> 

o 

o 

o 

-*K> 

-w» 

a 

a 

a 

nS 

•C' 5 ’ 

o 

’M-.i 

a 

a 

fc** 

”*3 

• 

• 


J. Valett 
NASA/GSFC 
20 of 21 



KEY ISSUES 





£ 






Co 



00 



cd 


or» 

Co 


c 

CO 

CS 

o»* 

h=s 

E 

CD 

«!! 3 I 

O 


CO 

O 

c$ 

S-N 

g 

o 




<*•» 

a 

CD 

c 

£> 


HM 

CP 

CD 


CD 


-o 



8 


o 


a 

CD 

-M 

v"* 

o 

“o* 

"3 

«<r^> 

o 

.o 


s* 

*•— 

c$ 

A 

CD 

* 4 - 

*a 

5 ^ 

$- 

.52 

o 



rO 

$ 

-#-» 

?- 


Jo 

■ o> 


«* 

£ 

o 

<o 

8 

a 


>- 

<o 

5 $ 

A 


<n 

m 

<D 


CN. 

£ 

o 

• «s> 


S- 

o 


£ 

'C^> 

Co 

•o^ 


£ 

CP 

CO 

CO 

V. 

Ph 

co 

5- 


0> 

8 
M — 
«♦— 
<D 

.52 

S’ 

"o 

■*3 

CO 


<s> 

o' 


a 

o 


£ 

'tS 

co 


a 

5^ 

05 

CP 


£ 

CP 

-o 

£ 

o 

*<?>» 


Co. 


a 

s 


o 

o 


?* 

1 

O 

'<S> 

Co 

m 

£ 

a 

CP 

o 

tn 

•< 3 ^ 

Co 

CP 

C 5 

j£ 

■<*> 

?s 

15 

-*-» 


£ 

4 > 

£ 

a 

3 t 

a 


1 


* 


* 

CV? 




CP 

5- 

a 


o 

CO 


CP 


<D 

<o 


m 

£ 

c5 

t) 


o 

<D 




T3 


CO 

<P 


.2 

8 

o 


J. Valett 
NASA/GSFC 
21 of 21 



